Group Bitmap Index: A Structure for Association Rules Retrieval

نویسندگان

  • Tadeusz Morzy
  • Maciej Zakrzewicz
چکیده

Discovery of association rules from large databases of item sets is an important data mining problem. Association rules are usually stored in relational databases for future use in decision support systems. In this paper, the problem of association rules retrieval and item sets retrieval is recognized as the subset search problem in relational databases. The subset search is not well supported by SQL query language and traditional database indexing techniques. We introduce a new index structure, called Group Bitmap Index, and compare its performance with traditional indexing methods: B tree and bitmap indexes. We show experimentally that proposed index enables faster subset search and significantly outperforms traditional indexing methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bitmap Indexing-based Clustering and Retrieval of XML Documents

This paper describes a bitmap indexing based technique to cluster XML documents. XML documents can be hierarchically represented by elements. To improve performance of information retrieval, documents can be indexed using bitmap techniques. Such a bitmap index is sparse, meaning it contains unnecessarily many zero bits, especially for the word dimension. To remove zero bits and improve the perf...

متن کامل

BitCube: Clustering and Statistical Analysis for XML Documents

In this paper, we describe a new bitmap indexing technique to cluster XML documents. XML is a new standard for exchanging and representing information on the Internet. Documents can be hierarchically represented by XML-elements. XML documents are represented and indexed using a bitmap indexing technique. We define the similarity and popularity operations available in bitmap indexes and propose ...

متن کامل

Bitmap based algorithms for mining association rules

Discovery of association rules is an important problem in Data Mining. The classical approach is to generate all itemsets that have support (i.e., the fraction of transactions containing the itemset) above a user given threshold. Most existing algorithms aim at reducing the number of scans over the transaction database, i.e., the I/O overhead. We consider the problem of how to calculate efficie...

متن کامل

New Approach to Optimize the Time of Association Rules Extraction

The knowledge discovery algorithms have become ineffective at the abundance of data and the need for fast algorithms or optimizing methods is required. To address this limitation, the objective of this work is to adapt a new method for optimizing the time of association rules extractions from large databases. Indeed, given a relational database (one relation) represented as a set of tuples, als...

متن کامل

An Efficiently Algorithm for Mining Association Rules

—Association rules mining is one of the most important topic in data mining. A new algorithm for mining association rules is proposed in this paper. In data mining, the process of counting any itemset`s support requires a great I/O and computing cost. An impacted bitmap technique to speed up the counting process is employed in this paper. Nevertheless, saving the intact bitmap usually has a big...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998